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Abstract 

We propose a deterministic algorithm for approximately counting the number of list colorings of 
a graph. Under the assumption that the graph is triangle free, the size of every list is at least aA, 
where a is an arbitrary constant bigger than a** = 2.8432 . . ., and A is the maximum degree of the 
graph, we obtain the following results. For the case when the size of the each list is a large constant, 
we show the existence of a deterministic FPTAS for computing the total number of list colorings. 
The same deterministic algorithm has complexity 2'^('°s "\ without any assumptions on the sizes of 
the lists, where n is the instance size. We further extend our method to a discrete Markov random 
field (MRF) model. Under certain assumptions relating the size of the alphabet, the degree of the 
graph and the interacting potential we again construct a deterministic FPTAS for computing the 
partition function of a MRF. 

Our results arc not based on the most powerful existing counting technique - rapidly mixing 
Markov chain method. Rather we build upon concepts from statistical physics, in particular, the 
decay of correlation phenomena and its implication for the uniqueness of Gibbs measures in infinite 
graphs. This approach was proposed in two recent papers [?] and [?]. The principle insight of this 
approach is that the correlation decay property can be established with respect to certain computation 
tree, as opposed to the conventional correlation decay property with respect to graph theoretic 
neighborhoods of a given node. This allows truncation of computation at a logarithmic depth in 
order to obtain polynomial accuracy in polynomial time. 

1 Introduction 

This paper is devoted to the problem of computing the total number of list colorings of a graph. It 
is further concerned with the problem of computing a partition function corresponding to a Markov 
random field (also known as graphical) model. The setting for the list coloring problem is as follows. 
Each node of a given graph is associated with a list of colors. An assignment of nodes to colors is called 
list coloring if every node is assigned to some color from its list and no two nodes sharing an edge are 
assigned to the same color. When all the lists are identical, the problem reduces to the problem of 
coloring of a graph. The problem of determining whether a list coloring exists is NP-hard, but provided 
that the size of each list is stictly larger than the degree for each node, a simple greedy algorithm 
produces a coloring. We are concerned with the corresponding counting problem - compute the total 
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number of list colorings of a given graph/list pair. This problem is known to be hard even for the 
restricted problem of counting the colorings, and the focus is on the approximation algorithms. The 
existing approximation schemes are mostly based on the rapidly mixing Markov chain technique, also 
known as Glauber dynamics approach. It was established by Jerrum [?] that the Glauber dynamics 
corresponding to graphs where the ratio of the number of colors to degree satisfies q/ IS. > 2, mixes 
rapidly. This leads to a randomized approximation algorithm for enumerating the number of colorings. 
The 2-barrier was first broken by Vigoda [?], who lowered the ratio requirement to 11/6. Many further 
significant improvements were obtained subsequently. The state of the art is summarized in [?]. For a 
while the improvement over 11/6 ratio came at a cost of lower bound Q,{\ogn) on the maximum degree, 
where n is the number of nodes. This requirement was lifted by Dyer et al. [?]. 

In this paper we focus on a different approach to the counting list colorings problem. Our setting is a 
list coloring problem. We require that the size of every list is at least aA+Z?, where a > a** = 2.8432 . . . 
- the unique solution to ae~« = 2, and /? is a large constant which depends on a — a**. Our girth 
restriction is g > 4, namely, the graph is triangle-free. We obtain the following results. First, assuming 
that the size of each list is at most a constant, we construct a deterministic Fully Polynomial Time 
Approximation Scheme (FPTAS) for the problem of computing the total number of list colorings of a 
given graph/list pair. Second, for arbitrary graph/list pair (no assumptions on the list sizes) we construct 
an approximation algorithm with complexity 2*^^^°^ ^\ Namely, our algorithm is super-polynomial but 
still significantly quicker than exponential time. 

Although our regime a > 2.8432 ... is weaker than q/A > 2, for which the Markov chain is known 
to mix rapidly, the important contribution of our method is that it provides a deterministic algorithm. 
Presently no deterministic algorithms are known for counting approximately the number of coloring of 
a graph. 

Our approach is based on establishing a certain correlation decay property which has been consid- 
ered in many settings [?], including the coloring problem [?], [?], [?],[?] and has been recently a subject 
of interest. In particular, the correlation decay has been established in [?] for coloring triangle-free 
graphs under the assumption that a > a* = 1.763. . ., the unique solution of ae~a = 1. (Some mild 
additional assumptions were adopted). The principal motivation for establishing the correlation decay 
property comes from statistical physics, in particular the connection with the uniqueness of the associ- 
ated Gibbs measure (uniform measure in our setting) on infinite versions of the graph, typically lattices. 
Recently, however, a new approach linking correlation decay to counting algorithms was proposed in 
Bandyopadhyay and Gamarnik [?] and Weitz [?]. The idea is to use correlation decay property in- 
stead of Markov sampling for computing marginals of the Gibbs (Tiniform) distribution. This leads to 
a deterministic approach since the marginals are computed using a dynamic programming like scheme 
(also known as Belief Propagation (BP) algorithm [?]). This approach typically needs a locally-tree like 
structure (large girth) [?] in order to be successful. The large girth assumption was explicitly assumed 
in [?], where the problems of computing the number of independent sets and colorings in some special 
structured (regular) graphs was considered. Weitz [?] cleverly by-passes the large girth assumption by 
using a certain self-avoiding tree construction thus essentially reducing the problem to a problem on a 
tree with careful boundary conditions implied by independent sets. This idea was used recently by Jung 
and Shah [?] to introduce a version of a BP algorithm which works on a non-locally-tree like graphs, 
where appropriate correlation decay can be established. This approach works for binary type problems 
(independent sets, matchings, Ising model) but does not apparently extend to multi-valued problems. 

In this paper we propose a general deterministic approximate counting algorithm which can be 
used for arbitrary multi-valued counting problem. We also by-pass the large girth assumption by 
considering a certain computation tree corresponding to the Gibbs (uniform for the case of colorings) 
measure. Our principal insight is establishing correlation decay for the computation tree as opposed 
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to the conventional correlation decay associated with the graph-theoretic structure of the graph. We 
provide a discussion explaining why it is crucial to establish the correlation decay in this way in order 
to obtain FPTAS. Contrast this with [?] where correlation decay is established for the coloring problem 
but in the conventional graph-theoretic distance sense. Our method is similar to the self-avoiding walk 
method of Weitz's but somewhat more direct as the step of relating the marginal probability on a 
graph to the marginal probability on the tree is by-passed in our computation tree approach. The 
advantage of establishing correlation decay on a computation tree as opposed to the original graph has 
been highlighted also in [?] in the context of BP algorithms and the Dobrushin's Uniqueness condition. 
More importantly our approach works for general, not necessarily two-valued model. We extend our 
approach to Markov random field model and also show that under some conditions the computation 
tree satisfies the correlation decay property and, as a result, one obtains a deterministic algorithm for 
computing approximately the associated partition function. 

The remainder of the paper has the following structure. The model description and the main 
result are stated in Section [2j Some preliminary technical results are established in Section [3l The 
description of the algorithm and its complexity are subject of Section [H The principal technical result 
is established in Section [5j The key result is Theorem [2l which establishes the correlation decay result 
on a computation tree arising in computing the marginals of the uniform distribution on the set of all 
list colorings. Section [6] is devoted to the extension of our approach to a Markov random field. Section [7| 
provides a brief comparison between the correlation decay on a computation tree and the correlation 
decay in a conventional sense. Some conclusions and open problems are in Section [HI 

2 Definitions and the main result 

We consider a simple graph G with the node set V = {vi,V2, ■ ■ ■ ,v\v\}- Our graph is assumed to 
be triangle-free. Namely the girth (the size of the smallest cycle) is at least 5 > 4. Let E, A denote 
respectively the set of edges and the maximum degree of the graph. A{v) denotes the degree of the node 
V. Each node v is associated with a list of colors L{v) C {1, 2, . . . , q} = Ui,gyL(t;), where {1, 2, ... , q} 
is the total universe of colors. We let L = {L{v),l < v < n) denote the vector of lists. We also let 
\\L\\ = maxt, the size of the largest list. The list-coloring problem on G is formulated as follows: 

associate each node v with a color c{v) € L{v) such that no two nodes sharing an edge are associated 
with the same color. When all the lists are identical and contain q elements, the corresponding problem 
is the problem of coloring G using q colors. We let \L{v)\ denote the cardinality of L(v). It is easy to 
see that if 



for every node v, then a simple greedy procedure produces a list-coloring. We adopt here a stronger 
assumption 



where a is an arbitrary constant strictly larger than a**, the unique solution of a** exp(— ^) = 2. That 
is a** ~ 2.8432 .... We also assume that /3 is a large constant which depends on a. To be more specific 
we assume that f3 = /3(a) is large enough to satisfy 



L{v)\ > A(t;) + 1 



(1) 



L{v)\ > aA{v)+f3, 



(2) 




f'' > 2, 



(3) 



which is always possible when a > a' 



,** 
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Let Z(G, L) denote the total number of possible list-colorings of a graph/list pair (G, L). The corre- 
sponding counting problem is to compute (approximately) Z{G,L). In statistical physics terminology, 
Z{G, L) is the partition function. We let Z{G, L, x) denote the number of list colorings of (G, L) which 
satisfy some condition x- For example Z{G, L, c{v) = i, c{u) = j) is the number of list colorings such 
that the color of f is i and the color of u is j. 

On the space of all list colorings of G we consider a uniform probability distribution, where each list 
coloring assumes weight 1/Z(G,L). For every node/color pair v £V,i € L{v), j^{c{v) = i) denotes 
the probability that node v is colored i with respect to this probability measure. The size of the instance 
corresponding to a graph/list pair(G, -L) is defined to be n = max{|y|, \E\,q}. 

Definition 1. An approximation algorithm A is defined to be a Fully Polynomial Time Approximation 
Scheme for a computing Z{G, L) if given arbitrary 6 > it produces a value Z satisfying 



in time which is polynomial in n,- 

We now state our main result. 

Theorem 1. There exist a deterministic algorithm which provides a FPTAS for computing Z{G, L) for 
arbitrary graph list pair G,L satisfying when the size of the largest list \\L\\ is constant. The same 
algorithm has complexity 2^^^°^ without any restriction on \\L\\, where n is the size of the instance. 

3 Preliminary technical results 
3.1 Basic recursion 

We begin by establishing a standard relationship between the partition function Z(G, L) and the 
marginals Pj^ ]^{c{v) = i). The relation, also known as cavity method, is also the basis of the Glauber 
dynamics approach for computing partition functions. 

Proposition 1. Consider an arbitrary list coloring ii, . . . , i^y^ of the graph G (which can be constructed 
using a simple greedy procedure). For every A; = 0,l,...,|y| — 1 consider a graph list pair G^, -L^, where 
(Go, Lq) = (G, L), Gk = G \ {vi, . . . , Vk}, k > 1 and the list L/. is obtained by deleting from each list 
L(vi), I > k a color ir, r < k if {vi,Vr) € E. Then 

z{G,L)= W iPGli:,(^(^'^) = ^'^)- 

o<fc<|y|-i 

Proof. We have 

p w ^ . ^ Z{G,L,c{vi) = ii) Z{Gl,Li) 
^CLicivi) = n) = = 

from which we obtain 

Z{G, L) = PG,z(c(t;i) = ii)-^^(Gi, Xi). 
Iterating further for > 2 we obtain the result. □ 



1-5 < 



Z 



Z{G,L] 



< 1 + 5, 
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Our algorithm is based on a recursive procedure which relates the number of list colorings of a given 
graph/list pair in terms of the number of list colorings of some reduced graph/list pairs. 

Given a pair (G, L) and a node v € G, let vi, . . . ,Vm he the set of neighbors of v. For every pair 
(k, i) € {1, . . . , m} X L{v) we define a new pair (G^,, Lk^i) as follows. The set of nodes of G is Vfe = V\{v} 
and Lk^i{vr) = L{vr) \ {i\ for 1 < r < /c, Lj^j{u) = L{u) for all other u. Namely, we first delete node v 
from the graph. Then we delete color i from the lists corresponding to the nodes Vr,r < k, and leave 
all the other lists intact. 

Lemma 1. The graph/list pair (G^jL^j) satisfies (0j for every 1 < k < m,j L{v), provided that 
(G, L) does. 

Proof. When we create graph from G the list size of every remaining node either stays the same or 
is reduced by one. The second event can only happen for neighbors vi, . . . ,Vm of the deleted node v. 
When the list is reduced by one the degree is reduced by one as well. Since a > 1, the assertion follows 
by observing that \L{vi^)\ > aA{vk) + /? implies |i^(t'A:)| — 1 > a{A{vk) — 1) + f3. □ 

The basis of our algorithm is the following simple result. 

Proposition 2. Given a graph/list pair {G,L) and a node v, suppose A{v) = m > 0. For every 
i G L{v) 

TP , , ni<.<ji-iPc..L,.(cw = o) 

^'^^'^'^ "'^ " E,..(.)ni<.<„(i -Po„,L,,w-) = J))- ^ ^ 

The recursion as well as the proof is similar to the one used by Weitz in [?], except we bypass the 
construction of a self-avoiding tree, considered in [?]. 

Proof. Consider a graph/list (G^,, L) obtained simply by removing node v from G, and leaving L intact 
for the remaining nodes. We have 

Pjj_^(c(u) =i) 
Z{G,L,c(v) = i)Z-\G,L) 



E,-6i(,)Z(G,Z,c(^;)=i)Z-i(G,L) 
Z{Gv, L, c{vk) ^ i, 1 < k < m) 

T,jeL{v) ^(^v, L, c{vk) ^j, I <k<m) 
IPG„,L(c(^fc) ^i,l<k<m) 



m 



Now, for every j £ L{v) 

2<fc<m 

We observe that -Lij = L for every j (no colors are removed due to the vacuous condition r < 1), and 
+ / j, 1 < r < fc) = PG„,L,,,(c(^fc) + j). Namely 

l<k<m, l<k<m 

Substituting this expression we complete the proof. □ 
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3.2 Upper and lower bounds 

The condition ([2]) allows us to obtain the following simple bounds. 
Lemma 2. For every G, L, node v and a color i G L{v) 

Proof. Observe that given an arbitrary coloring of the neighbors vi,...,Vm of v, there are at least 
|L(u)| — A{v) > (3 colors remaining. Then the upper bound holds. □ 

From this simple bound we now establish a different upper bound and also a lower bound using the 
triangle free assumption. 

Lemma 3. There exist eo = eo(a) € (0,1) and j3 > Q such that for every G,L, node v and a color 
i G L{v) 



^.^^^^^)=^)^2A(.)(l + eo)' 



We note that the upper bounds of this lemma and Lemma [2] are not comparable, since values of 
A(f ) could be smaller and larger than /?. 

Proof. We let vi, . . . ,Vm denote the neighbors of v, m = A{v) and let u^j, denote the set of neighbors 
of Vk, other than v for k = 1, . . . ,m. We will establish that for any coloring of nodes (ffcr), which we 
generically denote by c, we have 

q-\l-(3-^)^<F^r{c{v)=z\c)< 



2m(l +eo) 



The corresponding inequality for the unconditional probability then follows immediately. Now observe 
that, since the girth is at least 4, then there are no edges between Vk- Then ¥^ j^{c{v) = i\c) is the 

probability ¥j{c{v) = i) that v is colored i in a depth-1 tree T = {v,vi, . . . ,Vm}, where the lists L{vk) 
of Vk are obtained from L{vk) by deleting the colors used by the neighbors v^r by coloring c. From 
the assumption ([2]) we have that the remaining lists L{vk) have size at least |L(ffc)| — A{vk) > P each. 
Let ti = Fj{c{v) = i). For each color j G L{v) let tj^k = l/l-^('yfc)| if j S L{vk) and = otherwise. 
Proposition [2] then simplifies to 

ni<fc<m(l ~ , 1 .p-N 
^ T^ (5) 



Yjj(^L{v) ni<A;<m(f ^hk) J2jeL(v) Y[l<k<mi^ 

for every i G L{v), where ni<fc<m defined to be equal to unity when m = 0. From the equality part, 
applying tj^k < 1//?, we get 

t.>\Liv)\-Hl-P~'r>q-\l-p-T, 

and the lower bound is established. 

We now focus on the upper bound and use the inequality part of ([5]). Thus it suffices to show that 

Yl 11(1 - ^ 2(1 + eo)m (6) 

jeL{v) k 
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for some constant eo > 0. Using the first order Taylor expansion for logz around z = 1, 

n n ^'^-''^'^ 

l<k<Tn l<k<m, 
l<k<m 

for some < 9j^k ^ ^j.fci since —1/z'^ is the second derivative of log z. Again using the bound tj^k ^ 1//?, 
we have (1 — 0j,fc)^ > — l//?)^- We assume that /? is a sufficiently large constant ensuring (1 — 1//?)^ > 
1/2. Thus we obtain the following lower bound 

l<k<'m l<k<m 

where Tj stands for J2k^j,k- Then 

where we have used an inequality between the average arithmetic and average geometric. Finally we 
observe 

E^^ = E*^>= E E ^7^^ = "^- 

3&L{v) j,k l<k<mj^L{v^) \^\"k)\ 

Thus 

E n ~ - \L{v)\e'^^^^^^^^ > {am + /3)e~«'^^^^^ > ame~"^^"^^^ 

j&L{v) l<k<m 

The condition a > a** implies that there exists a sufficiently large /3 such that ae ° > 2. We find 

— — ("1+—") 

< eo < .1 such that ae r^' = 2(1 + eo)- We obtain a required lower bound ([6]). □ 



4 Algorithm and complexity 
4.1 Description of an algorithm 

Our algorithm is based on the idea of trying to approximate the value of Pj^ z,{c{v) = i), by performing 
a certain recursive computation using (jH) a fixed number of times d and then using a correlation decay 
principle to guarantee the accuracy of the approximation. Specifically, introduce a function $ which 
takes as an input a vector {G,L,v,i,d) and takes some values ^{G,L,v,i,d) € [0,1]. The input 
(G, L, V, i, d) to ^ is any vector, such that such that f is a node in G, i is an arbitrary color, and d is 
an arbitrary non-negative integer. Function <I> is defined recursively in d. The quantity $ "attempts" 
to approximate j^(c{v) = i). The quality of the approximation is controlled by d. We define <I> as 
follows. For every input (G, L,v,i,d) such that i ^ L{v) we set ^{G, L,v,i,d) = 0. Otherwise we set 
the values as follows. 



• When d = 0, we set ^{G, L,v,i,d) = l/\L{v)\ for every input (G, L,v,i). (It turns out that for 
our application the initialization values are not important, due to the decay of correlations). 
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• For every d > 1, if A{v) = 0, then ^{G, L,v,i,d) = l/\L{v)\ for all i G L{v). Suppose A{v) = 
m > and vi, . . . , Vm are the neighbors of v. Then for every i G L{v) we define 

r • ^^ ■ r 1 1 \\i<k<mi'^-H'^v.Lk,i,Vk,hd-l)) 1 

=min — — =jf — — — — — -- . (7 

The last part of the expression inside min[-] corresponds directly to the expression of Propo- 
sition [2l Specifically, if it was true that ^{G^, Li^j^v^, j,d — 1) = P^^ ,(c(wa;) = j), then, by 
Lemmas I2|3l the minimum in ([7]) would be achieved by the third expression, and then the value 
of <I>(G, L, V, i, d) would be exactly P^. ]^{c{v) = i). 

We will use the correlation decay property to establish that the difference between the two values, 
modulo rescaling, is diminishing as d ^ oo. Note that the computation of ^ can be done recursively in d 
and it involves a dynamic programming type recursion. The underlying computation is done essentially 
on a tree of graph list pairs Gs,Ls generated during the recursion. We refer to this tree as computation 
tree with depth d. 

We now describe our algorithm for approximately computing Z(G, L). The algorithm is parametrized 
by the "quality" parameter d. 

Algorithm CountCOLOR 

INPUT: A graph/list pair {G,L) and a positive integer d. 
BEGIN 

Set Z = 1,G = G,L = L. 

While G^%, find an arbitrary node v (z G and a color i G L{v) . Compute 

p{v,i)^<^iG,L,v,i,d). (8) 

Set Z = p~^{v,i)Z,G = G \ {v},L{u) = L{u) \ {?} for all neighbors of f in G, and L{u) 
remains the same for all other nodes. 
END 

OUTPUT: Z. 



4.2 Some properties 

We now establish some properties of 

Lemma 4. The following holds for every G, L,v,i G L{v), d > 0. 

*(G,L,„,M)<mm[i,5j5^-L^], (9) 
Y, <f(G,£,i',i,(!) < 1, (10) 

^G,L,v,i,d)>q-\l-l/(3)^. (11) 

Proof. (llOp follows directly from the definition of To show ([9]) we consider cases. For d > 1 this follow 
directly from the recursion For d = 0, this follows since ^{G, L,v,i,0) = 1/|-L(t;)| < l/{aA{v) + (3) 
and 2(l-|-eo) < 2.2 < a. We now establish (llip . For the case d = this follows since l/|L(u)| > 1/q. For 
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the case d > 1 this fohows from the recursion ([7]) since 1/(2(1 + eo)A(u)) > 1/q and the third term 
inside the minimum operator is at least q~^{l — l//3)^, using upper bound <I>(G, i, d — 1) < 1/(3 
which we have from ([9]). 

□ 



4.3 Complexity 

We begin by analyzing the complexity of computing function $. Recall that n = max(|y|, \E\,q) is the 
size of the instance. 

Proposition 3. For any given node v, the function $ can he computed in time 2'^('^('°s ll^ll+^°s A))^ 
particular when d = O(logn), the overall computation is 2*-^*^^°^ If in addition the size of the largest 
list \\L\\ is constant then the computation time is polynomial in n. 

Proof. Let T{d) denote the complexity of computing function $(-,(i). Clearly, T(0) = 0(|[L||). We now 
express T{d) in terms of T{d — 1). Given a node v, in order to compute $(G, -L, i, d) we first identify 
the neighbors t^i, . . . , oi v. Then we create graph/list pairs Gy, Lj^^, 1 < k < m, j & L{v), compute 
$(-,d — 1) for each of this graphs, and use this to compute ^{G, L,v,i,d). The overall computation 
effort is then 

T(d) = 0(||L||AT(d-l)). 

Iterating over d we obtain T{d) = 0{\\Lf+^A'^) = 0(2('^+i)(i°s 11^11+1°^^)) = 20Wi°g 11^11+^)). When 
d = O(logn), we obtain a bound 2*-^^^°^ If in addition ||L|| = 0(1), then the assumption ([2]) implies 
A = 0(1), and then r(d) = n<^(i). □ 

The following is then immediate. 

Corollary 1. Suppose d = O(logn). Then the complexity of the algorithm CountCOLOR is 2^^"^°^^ 
If in addition the size of the largest list ||L|| is constant, then CountCOLOR is a polynomial time 
algorithm. 



5 Correlation decay 

The following is the key correlation decay result. 

Theorem 2. Consider a triangle-free graph/list pair {G,L) satisfying There exist constants 

< e < 1 which depend only on a, such that for all nodes v, colors i € L{v) and d > 



max 

ieL{v) 



logF^L{c{v)=i)-log^G,L,v,i,d) <0{n\l-ef). (12) 



This theorem is our key tool for using the values of ^ for computing the marginals ¥^ l{c{v) = i). 
We first establish that this correlation decay result implies our main result, Theorem [TJ 

Proof of TheoremUl We consider an arbitrary instance (G,L) with size n and arbitrary 6 > 0. We 
may assume without the loss of generality that n is at least a large constant bigger than C/5, for any 
universal constant C, since we can simply extend the size of the instance by adding isolated nodes. The 
proof uses a standard idea of approximating marginals ¥^ j^{c{v) = i) and then using Proposition [T] 
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for computing Z(G,L). From Proposition [H if the algorithm CountCOLOR produces in every stage 
k = 1,2, . . . ,\V\ — 1 a value p{v, i) which approximates ^^('^(^'s) ~ ^) '^ith accuracy 



n 



p{v,i) 



{c{vk) 



5 

< 1 + - 
n 



(13) 



then the output Z of the algorithm satisfies 



1 - - 

n 



< (1-- 

n 



Z(G,L) / 5 
< ^ ^ ' < 1 + - 
Z \ n 



Since \V\ < n and n is at least a large constant, we obtain an arbitrary accuracy of the approximation. 
Thus it suffices to arrange for (|13p. We run the algorithm CountCOLOR with d = \ ^ " ] , where e is 

the constant from Theorem [2j This choice of d gives (1 - e)'^ < Xjn^. Theorem El with the given value 
of d then implies 



log' 



i) 



Thus 



p{v,i) 



1 - O(^) < exp 



log 



1 



^{G,L,v,i,d) 
p{v,i) 



Oik 



<exp(0(-i)) = l + 0(-i 



This gives us (jlSp for all n > C/6 where C is the universal constant appearing in O(-). This completes 
the analysis of the accuracy. The complexity part of the theorem follows directly from Corollary [TJ □ 

The rest of the section is devoted to establishing this Theorem [2j The basis of the proof is the 
recursion As before, let vi,. . . ,Vm be the neighbors of w in G, m = A{v). Observe that (fT2|) holds 
trivially when m = 0, since both expression inside the absolute value become l/|L(f)| and the left-hand 
side becomes equal to zero. Thus we assume that m > 1. Denote by the degree of Vk in the graph 
Gy. In order to ease the notations, we introduce 

Xi,k = JPc^.^fe = ^ ^ ^(^)' 1 < ^ < "I 

X* = ^{G,L,v,i,d), ieL{v), 

= ^{Gy,Lk,i,Vk,i,d - 1), i e L{v) nLi^kivk)A <k<m 



Proposition 4. There exists a constant e > which depends on a only such that 
1 1 

— max I log(xi) - log(a;*)| < (1 - e) max 1 log{xj^k) - ^og{x* ,,)\ (14) 

m i&L{v) jGL(t)),fc:nife>0 mfc ' ' 

First we show how this result implies Theorem [2) 
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Proof of Theorem [H Applying this proposition d times and using the fact that we are summing over 
k : mk > 0, we obtain 

— max \log(xi) - log(x*)l < Mil - e)'^, 

m i£L{v) 

where 



M = max 

Ls 



log P^^ (civ) =1)- log $(G„ Ls,v, I, 0) 



and the maximum is over all graph/list pairs Gs,Ls appearing during the computation of $ and over 
all colors I. Recall that if I does not belong to the list associated with node v and list vector Lg, then 
Pjj^ £j^{c{v) = I) = ^{Gg, Ls,v,l,0) = (the first is equal to zero by definition, the second by the way 
we set the values of $). Otherwise we have from Lemma [2] and part (|lip of Lemma [J] that absolute 
value of the difference is at most 

log<? + Alog(/?/(/?-l)). 

Since m < A < n, /3 is a constant which only depends on a, and q < n, then we obtain M = 0{n) and 
mM = 0(n2). □ 

Thus we focus on establishing Proposition HI 

Proof of Proposition^^ Observe that for every i G L{v) \ Lkj{vk) we have Xj^^ = x*^ = 0. This is 
because the probability of node obtaining color i is zero when this color is not in its list. Similarly, 
the corresponding value of $ is zero, since we set it to zero for all colors not in the list. For every 
i G L{v) introduce 

Ai^ W {l-xi,k) (15) 

l<k<m 

and 

A, (16) 
Introduce A*, A* similarly. Applying Proposition [2] we obtain 

1 A* 

mm 



2(1 + eo)m' ^' A* 
Let 

A* 

X* = — 

We claim that in order to establish p4|) it suffices to establish the bound 

— |logXi - logx*| < (1 - e) max — I log(xj- fc) - log(x • fc)| 



(17) 
(18) 
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Indeed, if x* / x*, then x* 



mm 



1 



2(l+eo)m.' 13 



, 4]. On the other hand, by Lemmas I2I3I we have Xi < 



™l^L2(l+.o)m'^ 

We have 



, 4], implying Xi < x* < x* , and the bound for x* imphes the bound p^ . 



max I log(xj) — log(2;,*)| = max 

ieL{v) ieL{v) 



log Ai - log A* - log A + log A* 



(19) 



We introduce auxiliary variables m = log{xi),yi^k = log(xi^/c). Similarly, let y* = log(x*), y*i^ = log(x*^). 
Define y = {yi,k),y* = iUik)- Observe that if ruk = then for every color i xi^k = x*j^. This follows 
since both values are l/|Lj fc| when i S Lj^^ and zero otherwise. This implies yi^k = y*^. Then we rewrite 
(Hm as 



max \yi-yi 

i&L{v) 



^ ™r^A ~ exp(yi,fe)) - V log(l - exp(y*;.)) 

k:mf,>0 fc:mfe>0 

E n (l-exp(y,,fc)))+log( J2 n (l-exp(y*fc)) 

j€L{v) l<k<m j&L{v) l<k<m 



(20) 



where the sums X]i<fc<m were replaced by X]A:-mfc>o "^^^ *° ^^'^ observation yi^k = y*jt when ruk = 0. 

For every i denote the expression inside the absolute value in the right-hand side of equation ([20 
by Qi{y). That is we treat y* as constant and y as a variable. It suffices to prove that for each i 



Gi{y) < (1 - e) max — | log{xj k) - log(x • ,,) | 

jeL{v),k:mk>Omk' 



(21) 



Observe that Gi{y*) = 0. Let gi{t) = Qi{y* + t{y — y*)),t € [0, 1]. Then gi is a differentiable function 
interpolating between and Qi{y). In particular, 5i(l) = Qi{y)- Applying the Mean Value Theorem we 
obtain 

|5i(l)-5i(0)| = \gi{l)\ < sup Mt)\ 

0<t<l 



sup 

0<t<l 



Vg^{y* + t{y-y*)f{y-y* 



where the supremum is over values of t. We use a short-hand notation 

nj = n ~ exp(2/j,fc + t{yj^k - y],k))) 

l<k<m 

For each t we have 

V7rt*^,f *\\( *\ \^ -^Myi,k + t{yi,k-ylk)) , 

V^.(y +tiy-y)){y-y ^ = i - exp(,,, + 1(.., -,;,)) ^^'-^ " ^-^^^ 

H 



Again using the fact yj^k = Vj^k when ruk = 0, we can replace the sum X]i</c<m ^k:mk>o ™ ^'^^ 
expression above. For each j we have from convexity of exp 

exp(yj,fc + t{yj^k - y]^k)) < (1 - exp(y* ;,) + t exp(yj- fc) 

= (1 - t)Xj^k + tX* f, 

1 

< 



2(1 + eo)mfc 
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where the last inequahty follows from Lemma [3] and part Q of Lemma HI This bound is useful for 
terms with > (for this reason we only kept these terms in the sum X]A;m.fc>o)- Similarly using 
Lemma [2] and again part ([9]) of Lemma S] we obtain 



1 

< 



1 - exp(yj,fc + - y* fc)) 1 - (1 - t) exp(?/*^) - t exp{yj^k) 

1 



1 - (1 - t)Xj^k - tx* 

< 



J<k j,k 

1 



We obtain 



sup 

0<t<l 



'^Giiy* + t{y-y*))iy-y*) < Yl n — Uon^ ^ \yi,k-ylk 

. ^ „ (1 - 4)2(1 + eojmfc 



k:mi.>0 ^ /3 



^ EjeL{v)Ek:mk>oi'^ - ^) ^ (1 + ep) \yj,k-ylk\^j 

m \yj,k - ylk\ 

< 7- — ■ max 



(1 - ^)2(1 + eo) ieL(i;),fc:mfc>o mfc 
+ r- — ; max 



(1 - i)2(l + eo) jei(«),fc:mfc>o 



m \Vhk-y]^k\ 
max 



(1 - i)(l + eo) jeL(i.),fc:mfe>o mfc 
Combining with ()20p we conclude 

bi-yri ^ 1 \yhk-ylk\ 

max < -, max 



i(iL{v) m (1 - i)(l + eo) iGL(f),fc:mfe>0 mfc 

We now select a sufficiently large constant 13 = /3(eo) such that 

1-e^ <1. 

(l-^)(l + eo) 

This completes the proof of Proposition HI □ 



6 Extensions: Markov random field and partition function 

6.1 Model and the preliminary results 

The main conceptual point of this paper, namely construction a recursion of the form construction of 
a corresponding computation tree, establishing correlation decay property and application to a counting 
problem, can be extended to an arbitrary model of random constraint satisfaction problems with multiple 
values. In this section we provide details using a very general framework of Markov random fields 
(MRF), also known as graphical model [?],[?]• We show that generalizing (jj]) is straightforward. It is 
establishing the decay of correlation which presents the main technical difficulty. We provide a simple 
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and general sufficient condition and then illustrate the approach on specific statistical physics problem, 
namely g-state Potts model. Here we restrict ourselves for simplicity to MRF defined on simple graphs. 
Extensions to multi-graphs are possible as well. 

A Markov random field (MRF) is given as a graph G with node set V = {vi, . . . edge set 

E, an alphabet X and set of functions (f)^ : X -.^ M+, v ^V, fy^u '■ I^+i iv,u) G E. Consider a 

probability measure on Af'^' defined by 

P(X = x) = , 



for every x = (x^) S A''^', where Z = Hi-ey 4'v{xv) Yl(v u)&E fv,u{xvi Xu) is the normalizing constant 
called the partition function. Here X = {X^) is the random vector selected according to this probability 
measure. In the case Z = 0, the MRF is not defined. From now on assume that 
Hvfzv 4>v{xv) 'n.{v,u)(^E fv,u{^v,Xu) > for at least one x = (x^) G ^I^L 

Let us see that the problem of list-coloring can be cast as a Markov random field, where P(-) 
corresponds to the uniform probability distribution on the set of valid colorings. Given an instance 
of a list-coloring problem {G,L) with a universe of colors {1, . . . ,<?}, we set X = {1, . . . ,q}, (pvii) = 
l{i € L{v)} for all node/color pairs v,i, and fv,u{}-,3) = 7^ j}-, where !{•} is the indicator function. 
It is not hard to see that ¥'{x) = \/Z \i x corresponds to a valid coloring and = otherwise, and 
Z = Z{G, L) is the total number of valid list-colorings. Thus this MRF corresponds to the uniform 
distribution on the set of proper colorings. 

An instance of a MRF is denoted by M = (G, X, (j), /), with (p = {(pv), f = ifv,u)- We will write Pm 
and Zm for the corresponding probability measure and the partition function, respectively, in order to 
emphasize the dependence on the particular instance of the MRF. Computation of Zm is the principle 
goal of this section. As in the case of list-coloring model, denote by .^m[x] the sum of the terms in the 
partition function which satisfy some condition x- 

Observe that if cpy, f^^u > for all nodes and edges than Pm(^ = a;) > for every x = {xv),v £ V. 
Moreover, if f^^u = c for all edges for some constant c, then we obtain a product form solution 

^MiX=x)=l[^M^y 

Thus we might expect the correlation decay to take place when the values of f^^u are close to each 
other. This is the regime within which we will establish our results. Let (pmin = ^^'<^v,x (t>v{x),4'max = 

maXy^^ (py{x) and = (pmn^/cpmin- Also let /min = T^^^(v,u)&E,x,y&X fv,uix, y), /max = ^aX^v,u)eE ,x,yeX fv,u{ 

and let Cf = /max //min- From now on we assume that the following conditions hold 

4)y{x)>0, yveV,xeX (22) 
/^,,«(a;,y) > 0, y{v,u) e E, x,y e X (23) 

These conditions in particular ensure that c/ < oo. The following condition will be used in lieu of ([2]) 

^ {cf - c]^)A\X\'^ < 1. (24) 

The size of an instance M is 

n = max(^\V\, \E\, |log 0maxl, |log ^^-jj, | log /max|, I log /^i^l) . 
We now state the main result of this section. 
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Theorem 3. There exist a deterministic algorithm which provides an FPTAS for computing Zm for an 
arbitrary MRF instance M satisfying 121^). (2^), \2-^^ , whenever \X\ and A are constants. 

Our first task is obtaining a generaUzation of the cavity recursion given by Proposition [H Given a 
MRF M = (G, Af, (/), /), an arbitrary node v and an arbitrary element x* G <Y we consider a new MRF 
instance T^^x* [M] = (G, /) defined as follows. The graph G is the subgraph of G induced by all 

nodes other than v. X = X and f = f ■ </> is defined as follows. For every u which is a neighbor of 

1 

V, (t>u{x) = (pv^"^ fv,u{x* ,x), where A(t;) is the degree of v in G. For all the remaining nodes u we set 
4'u = (pu- 

Given a MRF M = (G, X, (p, f) let vi, . . . , v^v\ be an arbitrary enumeration of nodes. Consider an 
arbitrary x* = (x*^, . . . such that F{x*) > 0. Define Mq = M and = 7;^,^jjMfc_i], k = 

1, 2, . . . , where M|y| is an empty MRF and its partition function is set by default to unity. 

Proposition 5. The following identity holds. 

Zm= U F^\jX,^=x*J. 

l<k<\V\ 

Proof. We have 

X^a:eArl^'l:a;„, =x* 'i'vi{Xvi)Yiu:(vi,u)eEfvi,u{x^-^,Xu)Y[u^vi 'f'(Xu)Y[(v,u)eE, v,u^vi fv,u{Xv,Xu) 

Fm{X,, =x*J = '-^ 

_ Uu^vi (I^Uxu)U {v,u)£E, v,u^vi fv,u{Xy,Xu) 
_ Zmi 

Zm ' 

where the second equality follows since (pix^^^) Ylu {vi u)eE f{x%^,Xu) = Hm (t'^ixu) and the second prod- 
uct is over neighbors u of vi in G. Iterating further for k >2 we obtain the result. □ 

The identity in Proposition [5] provides an important representation of the partition function in terms 
of marginal probabilities. Thus, if we compute (approximately) these marginal probabilities, we can use 
them to obtain the value of the underlying partition function. 

6.2 Basic recursion and the algorithm 

Our next task is constructing a generalization of (G^, and extending Proposition [2] to MRF. Given 
a MRF M = (G, X,(p, f) and a node v let denote the MRF instance obtained naturally by removing 
node V. Namely, we keep and fu,w intact for all the nodes u ^ v and edges {u,w),u,w ^ v. Also, 
given a MRF M = (G, X, (j), f), a set of nodes vi, . . . ,Vr C V and a set of elements xi, . . . ,Xr € X we 
construct a MRF denoted by M[i;i, xi; . . . -jVrjXr] = (G, cp, f) as follows. The corresponding graph G is 
the subgraph induced by nodes V \ {vi, . . . ,Vr}. For every node u € G which has at least one neighbor 
among vi, . . . ,Vr we set 4>uix) = Hi fvi,u{xi, x)4>u{x), where the product is over z = 1, 2 . . . , r such that 
{vi,u) is an edge in G. For all the remaining u we set (pu = (pu- We also set f = f. The interpretation 
for M[i;i, xi; . . . ; Vr,Xr] comes from the following simple fact. 

Lemma 5. For every event 8 corresponding to the probability measure Pm; the following holds 

F'm{£\ ^k<r = Xk) = F'M[vi,xi;...;Vr,Xr]{^) ■ 
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Proof. The proof is obtained immediately by summing over all of the elementary events x ^ £ and 
observing that the terms 4>yj,{xk) cancel in the ratio Pm(^ A /\k<rXy^ = x^) /"^ M{f\k<rX.u^ = Xk)- □ 

Observe that the value of Cf corresponding to the MRF Wl[vi, xi; . . . -jVr, Xr] is the same of M. 
Thus, should M satisfy conditions (p2]) . (p3]) .(p4 |) . so does the instance M[fi,a;i; . . . -jVrjXr]- Moreover, 
defined for this MRF satisfies 

< c^cf (25) 

Now we obtain a recursion which serves as a basis for our correlation decay analysis and construction 
of an algorithm. 

Proposition 6. For every node v and its neighbors vi, . . . ,Vm, the following identity holds for every 
xq G X: 

TO \ ^v{xo)Y,xi,...,x,r,&xIYk=lfv,Vk{^0:XkWMlvi,xv,...-,Vk-i,Xk-i]i^Vk=^k) 

Fm(A„ — Xq) — ^ — — — -, [Zb) 

l^xeX 9v[X) l^xi,...,x„i(^X llfc=l Jv,Vk\X, Xk)irM[vi,xi;...;Vk^i,Xk_i]\^v^. — Xk) 

where the sum V!^. ^. = 1 when m = 0. 

Proof. The case m = is immediate. Assume m > 0. For every xq (z X we have the following identity 

Pm(X = Xq) = '^"(^"^^^iv.^'^e-y ^"'"[^^1 =Xl,...,X^^ = Xm]Uk'=lfv,Vki^0^^k) 
'l2x£X '^v{x)J2xi,...,Xm(^X ^mA^vi = Xl, . . . ,Xjj^ = ^m] nfe=l 

We divide both parts by Zm„ and write 

Zm^ [Xy-^ = Xl, . . . , Xy^ = Xm] _ -TT [Xy^ = Xl, . . . , Xy^ = Xk] 

z« ~ ii 



k: 



~^ ZUyiXy-^^ = Xl, . . . ,Xyf,_-^ = Xk-l] 



where the term corresponding to A; = is identified with Zm„- Applying Lemma [5l we recognize the 
A;-th term in this product as ^M[vi,xi;...;vk-i,Xk-i](-^vk = ^k) (note that the terms 4)y.{xj),j < k — 1 
cancel out). 

□ 

Proposition [6] also allows us to obtain upper and lower bounds on the marginal probabilities: 
Lemma 6. For every node v and xq ^ X 

-A (l^vixo) fv - ^ / A 4>vixo) 



22xMx) 22x^v{x) 



Proof. The proof follows from Proposition [H We have for every x € X, node v and its neighbors 
vi,...,Vm that < nr=i fv,Vk{x,Xk) < cjf^^^. Applying this bound to the numerator of ([26]) for 
x = Xq we obtain the required upper bound. Applying the same to the denominator, we obtain the 
required lower bound. □ 

We now provide sufficient conditions under which the construction of a computation tree for com- 
puting approximately marginal probabilities PM(-'^t) = x) as well as the partition function Zm can be 
performed in polynomial time. 

Similarly to the problem of coloring, we introduce <!>(•) - a surrogate for computing the marginal 
probabilities Pm(')- Consider a function ^M{v,x,d) defined recursively for an arbitrary instance of a 
MRF M = (G, X, (p, /), arbitrary node v, element x (z X and a non-negative integer d as follows. 
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• We set ^m{v,x,0) = 1. As in the case of coloring, it turns out that the initiahzation values are 
not particularly important, due to the decay of correlations. 

• For every node v with neighbors vi, . . . , Vm, every xq ^ X and d>l 

. , '^'^(^0)Exi,...,x™6;tnr=l^Mbi,xi;..,^;fc_i,Xfe_i]K,a;fc,d- l)^^,(xo,Xfc) 

where the sum Ylxi xra<^x~^ when m = 0. 

Assumptions (1221) and (f23l) guarantee that <I> > 0. 

We now describe our algorithm for approximately computing Zm- The algorithm is parametrized 
by d. It is based on computing recursively the values of <I>m- 

Algorithm ComputeZ 

INPUT: A MRF instance M = (G, , /) and a positive integer d. 
BEGIN 

Set Z = 1,M = M. 

While G 7^ 0, fix an arbitrary node G G and element x^X. Compute ^^{v,x,d). 
Set Z = ^~^{v,x,d)Z . 

Set M = 7^^2.[M], where the operator T was defined before Proposition [5]. 
END 

OUTPUT: Z. 



6.3 Complexity 

We begin by analyzing the complexity of computing function 

Proposition 7. For every v G V,x E X , the function ^m{v , x , d) can he computed in time O {2'^^^°^^'^^ n^) 
In particular when d = O(logn), and \X\, A = 0(1), the computation is polynomial in n. 

We note that the dependence on A is not as nice as in the case of the list-coloring problem, as it 
appears as A not log A in the exponent. Thus we can no longer claim that the computation time is 

20(log2n) ^j^-g p^gg_ 

Proof. Let T{d) denote the complexity of computing function $(-,(i). Clearly, T(0) = 0(1). We now 
express T{d) in terms of T{d — 1). Given a node v, in order to compute (^Miv^x^d) we identify the 
neighbors fi, . . . , of f . For every sequence xi, . . . , Xm G and every A; = 1, 2, . . . , m we compute 
^M[vx,xx;...\vt,_i,Xk--i]{'^k-,Xk,d — 1). The computation of each such quantity is T{d — 1). We use the 
obtained values to compute <I'm('^5 a^, d) via ([27|) . We also need O(n^) time to "take care" of multiplying 
by fv,Vk and by (j)^. The overall computation effort then satisfies 

T{d) = 0{\X\^T{d-l)+n^). 

Iterating over d we obtain T{d) = 0{\X\'^^n^) = 0(2'^^'°sl'^ln^), and the first part is established. When 
d = O(logn) and A, \X\ are constants, we obtain T{d) = n^^^\ □ 

The following is then immediate. 

Corollary 2. Suppose d = O(logn) and \X\,A = 0(1). Then ComputeZ is a polynomial time algo- 
rithm. 
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6.4 Correlation decay analysis 

We now establish a correlation decay result which is a key to proving our main result, Theorem [3l 

Theorem 4. Given an arbitrary MRF satisfying conditions (2^). [23\). [24\ ), the following holds for every 
node V and d > 1 



max 



logPM(-'^t. = x) - log$M(^^,a;,d) 
<(l-7) 



max 

l<k<m,y<^X 



vi,xi;...;vt-i,Xk. 



=y) 



[vi,xi;...;Vk-i,Xk. 



_^^{vk,y,d-l) (28) 



where vi, . . . , Vm are the neighbors of v. 

We first show how this theorem implies our main algorithmic result. 

Proof of Theorem [5l We claim that ComputeZ provides FPTAS for computing partition function Zm 
when d = O(logn) under the setting of Theorem [31 We have already established in Corollary [2] that 
the algorithm is polynomial time. 

Consider any MRF instance M obtained during the computation of $m(") as a part of performing 
algorithm ComputeZ. Applying ()25|) and Lemma [6] we obtain that for every node v in M and every 
X e X 



x) > Cf 



1 -Ad -1 



-A(d+1) 1 -1 



\X\ 4' f \X\ f <^ ~ f \X\ 't' ' 

Then applying the result of TheoremlHd times and recalling ^fj{v, x, 0) = 1, we obtain for d = O(logn) 

logFM{X^ = x)-\og^M{v,x,d)\ < (1 -7)'^(A(d+l)logc/ + log|;f| + logc^) 

= (1 -7) '^0((in log n) 



1 



0(n log^ n) 



n 



1 



o{i) 



n 



where the last step is obtained by selecting d = C log n for sufficiently large constant C. 



^M{v,x,d) 



1 



< exp(n 



1 



1 



n 



We conclude that ^Miv,x,d) provides an approximation of marginal probability ¥m{X^ = x) with an 
inverse polynomial error. The remainder of the proof is the same as for Theorem [2j □ 

Proof of Theorem\^ Fix a node v and an element xq G X. Let vi, . . . ,Vm be neighbors of v. When 



m = we the left-hand side of (j28p is zero. Thus assume m > 0. In order to ease the exposition 
we introduce some notations. Set z = logPM(-'^t. = xq), = logPM[«i,a;i;...;Dfe_i,Xfe_i] (-^^t^fe = Xk). 

Similarly z = log ^>m(w, xq, d), Za;i,...,Xfe = ^og^M[vi,xi;...;vk-i,Xk-i]i'^k,Xk,d - 1). Also let z denote the 
vector {zxi,...,x^.), 1 < k < m, xi, . . . , Xm £ X and z denote the vector {zx-^^...^Xk)^ 1 < k < m, xi, . . . , Xm € 
X. Both vectors have dimension Yli<k<m l*^!™- Then we can rewrite (p6]) as 



log 



,Xm&X W.k=l fv,Vk{xo, Xk) exp(Za;i, 



Y^xeX 't>v{x) Yjxi,...,XmeX Y\k=l fv,Vf,{x, Xk) exp(Za;i,...,Xfe) 



(29) 
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and rewrite ([^T]) as 

Introduce a function Q defined on a vector w = {wxi,...,Xk), 1 < ^ < m,xi, . . . ,Xm € X with the same 
dimension J2i<k<m \^\"^ ^ foUows: 

_log Mxo)Ex^,...,x,^GxUT=ifv,vkixo^Xk)eMwxi,...,xk) ^^^^ 

which we rewrite as 

log0^,(xo) + logt/i(w) - log ^2 (if) 
where the definition of Qi and ^2 is immediate. 

m 

Xl,...,XmGA' fc = l 

m 

,v,,{x,Xk)exp{wx-i^,...,Xk) 

XdX X\,...,Xm^X k = \ 

We have z — z = Q{z) — Q(z). Thus estabhshing (p8|) reduces to showing 

|a(;^)-a(i)I<(l-7)ll^-^l|L^. 
Applying Mean Value Theorem, there exists t € [0, 1] such that 

z-z = Vg{tz + (1 - t)z)'^{z - z) 

further implying 

\z-z\< ||Vg(tz + (l-t)z)||i,||z-z)|U^. 

It then suffices to establish 

||Vg(tz + (l-t)z)||L, < 1-7. 

In the following lemma we show that this bound holds for an arbitrary input vector w and thus complete 
the proof of Theorem IH □ 

Lemma 7. For every vector w 

\\vg{w)\\L, < 1-7- 

Proof. Fix an arbitrary sequence x?, • • • ,x^^ G X and the corresponding variable ifa;0,...,xO • We have 



^1 7^— --^2 



■^I'-'-'-^feg -^I'-'-'-^feg -^I'-'-'-^feQ 
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We have 



dw„o „o 

Using /min < fv,vk{xo,xl) < Cf f rain, we obtain 

.1 Ogl ^ (nfcLieXpK0,„„,0)) Ex,+„...,Xrr.exUT=ko+l^MWxu...,X,) 



<c7 
<4. 



Similarly, we obtain 



'>"^a;i,...,a;fc 



Using again /min < fv,u{x,y) < Cffmin we also obtain 



(Ea:e;(^ '/'^(^)) (rifcii expKo^...,^o)) Ex,+i,...,x^eAr IlLiko+i expKi,...,c.J 

{i2x€x Mx)) i2x,,...,xm€x nr=i expKi,...,^J 

(rifcli cxp(u;^o,...,^o)) Ex,+„...,x,„ex rifcUo+i (^Mwx^,...,xk) 



Hxi,...,Xm&X rifcLl ^'^V{Wxi,...,Xk) 



<cf. 



Similarly, 



g-i ^ 

Since the dimension of the argument w is J2i<k<m i'^l™ < ^ll-'^ll'^) then we conclude 

\\VG{w)\\L,<{cf-cJ^)A\X\^<l-r 
This concludes the proof. □ 
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6.5 Example: Potts model 

One of the most widely studied objects in the statistical physics is g-state Potts model. It is described 
in the terminology of MRF as follows. Given a graph G we set </>^ = 1 for all nodes v. X = {1, 2, . . . , q}. 
A parameter /? called inverse temperature is fixed. The coupling functions / are set as fu,v{x-,y) = 
exp(/31{j; = y}) for all nodes u,v and all elements x,y G X. The case /3 > corresponds to the 
ferromagnetic Potts model. In this case the distribution Pm(") "favors" assignments which select the 
same element along the edges. The case /3 < corresponds to the anti-ferromagnetic Potts model, and 
in this case the distribution favors assignments with different elements along the edges. The extreme 
case (3 = —oo corresponds to the usual coloring problem, where monochromatic coloring are simply 
forbidden. The special case g = 2 is called Ising model - one of the cornerstone models of the statistical 
physics. 

It is immediate that conditions (I22p and (I23p are satisfied by this model provided < oo. Thus 
an immediate corollary of Theorem [3] is the following algorithmic result. 

Corollary 3. There exists a deterministic FPTAS for computing a partition function for a family of 
Potts model (G, q, j3) with constant constant degree A, constant number of colors q, and satisfying 

Observe that for large A, the largest inverse temperature (3 satisfying this condition behaves like 
0{-^-T^). We believe that this is an overly conservative estimate. We conjecture that in fact the 
correlation decay property can be established in the regime 

/3 = 0(^), (32) 

leading to a deterministic FPTAS. 

7 Comparison of the correlation decay on a computation tree and 
the spatial correlation decay property 

As we have mentioned above, the (spatial) correlation decay is known to hold for the coloring problem in 
a stronger regime a > a* ~ 1.763 . . ., then the regime a > a** considered in this paper [?]. This decay of 
correlation is established in a conventional sense: for every node v the marginal probability P(c(f ) = i) is 
asymptotically independent from changing a color on a boundary of the depth-d neighborhood B{v,d) 
of V in the underlying graph. In fact it is established that the decay of correlation is exponential 
in d. It is natural to try to use this result directly as a method for computing approximately the 
marginals F{c{v) = i), for example by computing the marginal FB{v,d){c{v) = i) corresponding to the 
neighborhood B{v,d), say using brute force computation. Unfortunately, this conventional correlation 
decay result is not useful because of the computation growth. In order to obtain e- approximation of 
the partition function, we need order 0{e/n) approximation of the marginals, which means the depth 
d of the neighborhood B{v,d) needs to be at least O(logn). Here n is the number of nodes. But 
the resulting cardinality of B{v,d), even for the case of constant degree graphs is 0(A'°s") = n'-^^^^ - 
polynomial in n and the brute-force computation effort would be exponential in n. Notice that even if 
the underlying graph has a polynomial expansion \B{v,d)\ < d^' , for some power r > 1, the brute- force 
computation would still be 0(exp(log'^ n)) which is super-polynomial. This is where having correlation 
decay on computation tree as opposed to the conventional graph theoretic sense helps. 
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8 Conclusions 



We have established the existence of a deterministic approximation algorithm for counting the number 
of list colorings for certain classes of graphs. We have further extended our approach to constructing 
deterministic approximation algorithm for computing a partition function of a Markov random field 
satisfying certain conditions. Along with [?] and [?] this work is another step in the direction of 
developing a new powerful method for solving counting problems using insights from statistical physics. 
This method provides an important alternative to the existing MCMC sampling based method as it leads 
to a deterministic as opposed to a randomized algorithm. Since the conference version of the paper 
[?] appeared, several new developments happened in this direction. A deterministic approximation 
algorithm for counting the number of partial matchings in constant degree graphs was constructed in 
Bayati et al. [?]. The result was further used by Gamarnik and Katz [?] for constructing a deterministic 
sub exponential algorithm for computing a permanent of an arbitrary 0, 1 matrix. Recently Nair and 
Tetali [?] introduced a somewhat different way of constructing a computation tree, closer to the original 
self-avoiding walk based construction of Weitz [?]. Furthermore, they established that a strong form 
of correlation decay (called very strong spatial mixing in the paper) implies correlation decay on the 
computation tree and ultimately leads to a polynomial time algorithm for computing a partition function 
of a MRF. Their setting also allows for a hypergraph structure. It would be interesting to use their 
result to perhaps tighten the condition ([2]) used in the present paper. 

The principle insight from this work, along with the work of Weitz [?] is the advantage of establishing 
the correlation decay property on the computation tree as opposed to the original graph theoretic 
structure. While we have established such correlation decay only in the regime a > 2.8432..., we 
conjecture that it holds for much lower values of a. In fact, just as it is conjectured that the Markov 
chain is rapidly mixing in the regime q > A + 2, we conjecture that the correlation decay on the 
computation tree holds in this regime as well, at least for the case of constant number of colors q. 
Finally, we conjecture that the polynomial time algorithms for computing partition function of a MRF 
can be constructed under weaker a assumption than (p^ . Specifically, we conjecture that for the case 
of Potts model, the critical inverse temperature /?* under which the correlation decay can be established 
on a computation tree behaves like (f32|l . 
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